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1. INTRODUCTION 

Animal behavior has always been a source of study and amazement for mankind [1], [2]. Flocking is 
a behavior in which several objects work or move together as coherent entities [3] or behave in a unified manner 
while changing their direction and shape [4]. Examples of animal flocking behavior include birds (flies in 
swarms), sheep (moving as herds), and fish (swimming in schools) [5]. In recent years, studying and 
understanding animal flocking behavior through simulations has attracted researchers from various scientific 
and engineering fields [6]. Simulating flocking behavior has been widely studied in computer graphics [3], 
games [7], [8], mobile networks [9], road design [10] and education [11], [12]. Such simulations enable users 
to test complex scenarios that have no impact on a real environment. To produce flocking behavior in groups 
of computer characters (objects), three rules in section 2.1 were applied to all objects in the group with 
convincing flocking behavior results: Alignment (i.e. steer to mean heading of mates in a flock), Cohesion (i.e. 
steer to mean position of flock mates), and Separation (i.e. steer to avoid crowded mates) [3], [8], [13]. 

As the object arrangement in a flock change constantly, every object in the flock must update its view 
by cycling through all objects in the flock to collect the required data for each object. This is computationally 
expensive, particularly when the number of objects rapidly increases. Simulating large numbers of objects in 
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real time is a challenging task, particularly when the objects are realistically separated, and collisions are 
avoided. This requires the development of intelligent approaches to facilitate the separation of flocking 
behavior. A common method for approaching the separation rule is to use the nearest neighbor (NN) 
algorithm [4]. This algorithm calculates the distance between object x and every other object in the training 
set. The NN algorithm sorts the calculated distances and then selects the k samples closest to the object x [14]. 
However, the complexity of the NN algorithm was O(n”). This is because of the separation rule in which the 
distance of every object from other objects in the entire flock is calculated. Such high complexity makes it 
inefficient for use in modern applications, such as games [15], crowd mobility [16], mobile network 
mobility [17], and arial robots [18]. Therefore, an optimized NN is required to reduce complexity when 
performing the separation rule for flocking behavior. 

The work in [4] utilized the k-nearest neighbor (KNN) method to compute the nearest neighbors in 
Flocking Behavior, where the nearest object is found by the majority of flock objects within k adjacent 
neighbors. However, two key issues are related to KNN [19]. The first concerns choosing parameter k, a user- 
predefined parameter that represents the number of nearest neighbors. A straightforward method for 
determining the best k, is cross-validation (i.e., trying several k values and choosing k with the highest 
performance). The second issue with the KNN is its high computational complexity for large sets of objects. 
Therefore, the work in [19] introduced the extended nearest neighbor (ENN) method. Unlike the classification 
method in KNN, in which information about nearby neighbors is considered, the ENN method considers 
information from all available objects. The performances of the KNN and ENN methods were discussed 
in [20]. The results showed that the ENN method is less prone to errors, although both have a high 
computational complexity for large sets. The partial distance (PD) algorithm proposed in [21] uses a premature 
exit condition during the search process to terminate the distance calculation at an early stage. However, its 
efficiency depends on the initial distance (i.e. itis less useful for larger distances). For this purpose, an efficient 
method for identifying initial distances to the PD algorithm was proposed in [22]. An NN method based on PD 
was proposed in [23] to provide an enhanced version of the partial distance approach (EPD). 

This paper presents an enhanced version of the partial-distance approach proposed in [23] to solve the 
flocking fish problem. We propose an efficient NN method based on the k-dimensional trees (KD trees) 
algorithm. The KD trees algorithm is a multidimensional binary tree, which is a specific storage structure that 
efficiently represents training data [14]. The proposed approach, together with other conventional NN methods, 
was applied using a Unity-3D game engine. Unity-3D allows users to use a set of powerful animation and 
interaction design tools to visually program and animate flocking behavior. The experimental results show that 
the performance of the proposed KD trees is better than that of the conventional NN methods. The remainder 
of this paper is organized as follows. Section 2 provides information related to the flocking simulation together 
with the main algorithms used in this paper. Section 4 describes the implementation of the proposed method. 
Section 5 presents the results, and section 6 concludes the paper. 


2. METHOD 

An efficient NN method based on the KD trees algorithm was proposed in this paper. Flocking 
simulation models and main classification algorithms were presented to better understand the research context. 
We also describe the implementation of the proposed method using Unity-3D game engine. 


2.1. Flocking simulation models 

Simulations are important for understanding flocking behavior [5]. The first flocking-behavior 
simulation (called ‘Birds’) was performed in 1986 by Craig Reynolds [3]. The simulation program was made 
after the simple agents in the system. Three conducts [24] (i.e. steering rules) were used in the simulation to 
determine how birds would move, namely cohesion, alignment, and separation. Another flocking simulation 
model was proposed by Heppner in 1990 [25]. This model consists of three conducts: Homing (every object 
aims to stay in the roosting area), Velocity regulation (every object aims to fly at a particular predefined flight 
speed, which aims to return to that speed if perturbed), and Interaction (two flock entities try to move apart 
when they are too close to one another). This paper focuses on the first simulation model proposed by 

Reynolds [3]. The Reynolds model is the simplest model for simulating animal behavior, such as that of birds. 

It is the most widely used simulation model [5]. Figures 1(a) to 1(c) illustrates the three rules of Reynolds’ 

model [8]. 

— Cohesion rule: Figure 1(a) [8] shows the cohesion rule, which implies that all flock objects remain together 
in a group without going on a separate way away from the group. To obtain this rule, every object must 
steer towards the average position of its neighbors in the flock (i.e., steer towards the center of a mass). 
Assume we have N objects (0,, 02 , ..., Oy) and the position of an object is denoted by o. position, then the 
‘center of mass' c of all N objects is given by (1). 
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c = (0,.position + 02.position +++ Oy.position)/N (1) 


— Alignment rule: Figure 1(b) [8] illustrates the alignment rule, which implies that all objects in a flock 
generally head in the same direction. Every object in the flock must steer towards a heading that is 
determined by the unit's velocity vector [5] and is equal to the average heading of the object’s neighbors. 
Normalizing the velocity vector for every unit gain unit heading vector makes the steering force a linear 
function of the angle between the current unit heading and the average heading of its neighbors. 

— Separation rule: Figure 1(c) [8] shows the separation rule that aims to avoid collisions between objects. 
Although objects may try to get closer to each other because of the alignment and cohesion rules, the 
separation rule aims to maintain objects at a minimum distance from each other. 


(a) (b) (c) 


Figure 1. Shows three flocking rules: (a) cohesion, (b) alignment, and (c) seperation [8] 


2.2. Nearest neighbor 

The NN algorithm is a well-known supervised machine learning algorithm used for classification and 
regression problems in different contexts. An NN was proposed in 1967 by Cover and Hart [26]. The 
identification of the nearest neighbors depends on the distance between the tested and training data samples. 
Given a query vector Xo and a set of N labeled instances, the aim is to predict the class label of xg on a 
predefined set of classes. One way to find the NN efficiently is to conduct a three-dimensional bucket sort and 
then check the neighbors’ adjacent buckets. The bucket positions of the objects with excessive deviation from 
the center of the bucket were adjusted incrementally as the buckets were transformed along with the flock. The 
bucket size involves a time-space trade-off: the smaller the buckets, the more buckets are needed, but the fewer 
members per bucket on average. Nevertheless, the O(n”) problem is not completely eliminated because of the 
worst-case distributions [4], [5], [27]-[29]. 


2.3. KD trees 

The KD trees algorithm is a multidimensional generalization of binary trees designed to handle spatial 
data in a simple manner [14], [30]. The root of the KD trees represents the k-dimensional space that contains a 
set of data points. Every node is a subset of the data points. Each nonterminal node has two child nodes, left 
and right, which are obtained by partitioning the subset of the node into two parts using a hyperplane orthogonal 
to one of the k coordinates axes (called the discriminating axis). The position of the hyperplane was selected 
such that each child contained approximately half of the parent’s data points. The discriminating axis is 
determined by selecting the most dispersed coordinates. Each terminal node defines a cell (called a bucket) in 
a multidimensional space containing a subset of data points with bucket size less than a predefined threshold. 
To find the nearest neighbor for a query point (q), we first find a bucket that contains q by traversing the tree. 
We then find the nearest point to q in the bucket. This will result in finding a “candidate” nearest neighbor, but 
there is still the possibility of a nearest point. Therefore, we traverse backward and start finding the distance 
from the other points in the neighboring boxes. The point with the nearest distance to q is the nearest neighbor 
that we are looking for. The KD trees data structure provides an effective method for examining the objects 
closest to the query objects, thereby significantly reducing the computation required to find the best 
matches [30]. The complexity of the KD trees is O(K log n) (expected) lookup times for the K nearest objects 
to a query object [31], [32]. Based on [32], the object nearest to the query object is determined by applying the 
following steps, 
— Start by exploring the bucket that contains the query object. 
— Compute distance to each other objects at bucket. 
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— Backtrack and find the nearest neighbor to the query object by finding the point with the shortest distance 
in adjacent buckets. 


2.4. Implementation 

Computer game developers usually face the problem of developing games that are fun, without having 
to program directly in low-level languages [33]. For this, game engines with graphical editors were developed 
not only to create visual artwork but also to animate it and scripting interactions with users. Several game 
engines have recently become available for educational and research purposes. Among several game engines, 
such as unreal engine, CryEngine, and Unity-3D, we utilized the latter in this research. In [8], de Byl and Penny 
used a Unity-3D game engine with the C# programming language to develop a computer game that simulated 
fish flocking behavior. Figure 2 shows a flock of 50 fish simulated by [8]. The figure shows a clear separation 
between the fish. 


Figure 2. A flock of 50 fish simulated by [8] 


Fish flocking in [8] was implemented using the Unity-3D game engine. Subsequently, we implement 
our approach to enhance the algorithm [8]. The Unity-3D game engine provides important features for studying 
flocking behavior, such as an easy-to-use graphical user interface for animating objects and controlling the 
interaction between objects. Furthermore, the Unity-3D game engine has a strong professional community that 
supports its progress, and is freely available for education and research. Figure 3 shows a snapshot (in C#) of 
how the NN computes the distances between fish objects. The figure shows that we have a value for the 
neighbor distance, which is the maximum distance that the fish need to be in order for them to flock. If they 
are outside this distance from each other, they will not take notice of each other, so they will only go to flock 
if they are near each other. To apply the separation rule, each fish needs to know all the other fish. An avoidance 
vector (vavoid) was created to avoid any nearby body. If we obtain within a small distance (1.0 unit), we want 
to avoid it; therefore, we calculate the (vavoid) vector as a vector that is facing off in the other direction. Our 
focus is on the (Vector3.Distances) function in Figure 3, which computes distances between fish 60 times per 
second with a complexity of O(n”). However, this is a time-consuming process. To reduce this time, we applied 
KD trees with a complexity of O(K logn) lookup times for the K nearest objects to a query object. 


1 GameObject[] fish; 

2 float neighborDistance = 2.0f 
3 float dist; 

4 foreach (GameObject go in fish) 
s { 

6 if(go != this.gameObject) 

7 { 


dist = Vector3.Distance(go.transform.position, this.transform.position) ; 


9 if (dist <= neighborDistance ) 

16 

11 if (dist < 1.0f) 

12 

13 vavoid = vavoid + (this.transform. position - go.transform. position) ; 
14 } 

15 } 

16 } 

a7 } 


Figure 3. A snapshot of how the NN works to compute the distances between fish objects 
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3. RESULTS AND DISCUSSION 

To examine the effectiveness of the KD trees approach, fish flocking in [8] was implemented together 
with the conventional NN [19], PD [22] and EPD [23] approaches using the Unity-3D game engine. The frames 
per second (FPS) were measured using different numbers of objects. Figures 4(a) and 4(b) show screenshots 
of the simulation results for the conventional nearest neighbor NN and the KD trees approach, respectively, 
when the number of objects is N=100 (minimum number of objects). The performance of the conventional NN 
in Figure 4(a) provided a lower FPS (111.2) than that of the KD trees (123.6 FPS), as shown in Figure 4(b). 
Furthermore, Figures 5(a) and 5(b) show screenshots of the simulation results for the conventional nearest 
neighbor NN and the KD trees approach, respectively, when the number of objects is N=1000 (the highest 
number of objects). The Results show that the performance of the conventional NN (9.8 FPS) in Figure 5(a) is 
much lower than that of the KD trees approach (102.2 FPS), as shown in Figure 5(b). 


— ee 
(a) (b) 


Figure 4. Simulation results representing FPS values when the number of objects (N=100) for: 
(a) the conventional NN approach and (b) the proposed KD approach 


(a) (b) 


Figure 5. Simulation results representing FPS values when the number of objects (N=1000) for: 
(a) the conventional NN approach and (b) the proposed KD approach 


Figure 6 shows the graphical performance of all the methods (average results of 10 runs). The results 
show that the performance of the KD trees approach proposed in this study is better than that of other 
approaches. The KD trees approach provided higher FPS rates compared to NN, PD, and EPD, while increasing 
the number of objects. The FPS using the KD trees approach remained at the same level for 100 and 200 objects 
and then slightly decreased as the number of objects increased, whereas the FPS rates for other approaches 
dropped at 200 and 600 object sizes. This indicates that the KD trees approach provides a more stable and 
consistent gaming quality. Furthermore, the NN, PD, and EPD approaches provided FPS rates lower than 60 
compared with the KD trees, which maintained FPS rates above 80, while the number of objects increased. 
This shows that using KD trees is better for interactive games compared to other approaches that can suffer 
from noticeable motion "flickering" artifacts. To test whether there was a significant difference between the 
performance of the KD trees approach and other approaches, the Mann-Whitney U test was applied in Table 
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1. Notably, the difference between the KD method and NN, PD, and EPD approaches was significant (P<0.05), 
and the difference was higher than that of the EPD method (P<0.05); this difference was positive (i.e. median 
of KD > medinas of NN, PD, and EPD). 


FRAME PER SECOND 


100 200 300 400 500 600 700 800 900 1000 
NUMBER OF OBJECTS 


—e—NN —&—PD —e—EPD —e—KD 


Figure 6. Graphical results (FPS) of the fish flock 


Table 1. Statistically significant differences of the performance values in Figure 6 (Mann-Whitney, 1-tail, 
Na=Nb=10, Median of KD=92.5) 


Difference in performance values _Z-score Pp Median values 
KD > NN 3.25 P<0.001 NN=42 
KD > PD 3.25 P<0.001 PD=S0 
KD > EPD 2.985 P<0.05 EPD=57.5 


Furthermore, the Pearson correlation coefficient was applied to test the correlation between the 
number of objects and FPS for each approach in Table 2. The results showed a strong negative correlation 
between each of the given flocking methods and the number of objects in the training set (-1<R<0). Hence, the 
performance of the algorithms (i.e., FPS) is influenced by the number of objects in the flock. 


Table 2. Pearson correlation coefficient R-values 
Approach R-value 
No of objects > KD -0.95 
No of objects > NN -0.97 
No of objects > PD -0.95 
No of objects > EPD -0.95 


4. CONCLUSION 

Virtual reality has been used in learning for the last two decades, such as immersive technology that 
focuses on increasing interaction with the world to aid learners in learning new knowledge. Learners can create 
scenarios that are not possible in the real world and test the results. For example, learners interested in the fish 
industry can use digital technologies to create scenarios to better understand and learn how fish behave in 
flocks. Flocking behaviors have been applied in different yet related research fields such as gaming and 
computer vision. However, utilizing a large number of objects in a real-time simulation environment is highly 
demanding in terms of the computing power. This is because of the high NN complexity (n squared) required 
to separate the n objects. We propose an efficient flocking algorithm and its application to fish flocking 
behavior. The KD trees method for computing the nearest neighbors was presented. The experimental results 
showed that the proposed algorithm provided better results than other algorithms when applied to the flocking 
fish problem. To evaluate the proposed approach, we applied it together with other conventional NN methods 
using the Unity-3D game engine, which allows users to visually program and animate flocking behavior. The 
results demonstrate that the proposed approach outperforms the conventional NN approaches. 
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